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Abstract. The celebrated Marchenko-Pastur theorem gives the asymptotic spectral dis- 
tribution of sums of random, independent, rank-one projections. Its main hypothesis is 
that these projections are more or less uniformly distributed on the first grassmannian, 
which implies for example that the corresponding vectors are delocalized, i.e. are essen- 
tially supported by the whole canonical basis. In this paper, we propose a way to drop this 
delocalization assumption and we generalize this theorem to a quite general framework, 
including random projections whose corresponding vectors are localized, i.e. with some 
components much larger than the other ones. The first of our two main examples is given 
by heavy tailed random vectors (as in the model introduced by Ben Arous and Guionnet in 
[5] or as in the model introduced by Zakharevich in [32 where the moments grow very fast 
as the dimension grows) . Our second main example, related to the continuum between the 
classical and free convolutions introduced in [TT] , is given by vectors which are distributed 
as the Brownian motion on the unit sphere, with localized initial law. Our framework 
is in fact general enough to get new correspondences between classical infinitely divisible 
laws and some limit spectral distributions of random matrices, generalizing the so-called 
Bercovici-Pata bijection. 



1. Introduction 

In 1967, Marchenko and Pastur introduced a successful matrix model inspired by the 
elementary fact that each Hermitian matrix is the sum of orthogonal rank one homotheties. 
Substituting orthogonality with independence, they considered in their seminal paper 
the N X N random matrix defined by 



N 

i=l 



where (X*)j>i is an i.i.d. sequence of real valued random variables and {Ul^)i>i is an i.i.d. 
sequence of A^- dimensional column vectors, whose conjugate transpose are denoted (W^)*, 
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independent of (Xj)j>i. As a main result, they proved that the empirical spectral measure 
of this matrix converges to a limit with an explicit characterization under the following 
assumptions: 

(a) N^p tend to infinity in such a way that p/N — y\ > 0; 

(b) the first four joint moments of the entries of Ulf are not too far, roughly speaking, 
from the ones of the entries of a standard Gaussian vector. 

In the special case where all X*'s are equal to one, the matrix ([T]) reduces to a so-called 
empirical covariance matrix, and its limit spectral distribution is none other than the 
well-known Marchenko-Pastur distribution with parameter A. 

It has to be noticed that even in the general case, the limit spectral distribution does 
not depend on the particular choice of the Ul^^s, granted they satisfy the above hypothesis. 
For example, one can choose Ul^ to have uniform distribution on the sphere of or 
with radius \/N, or to be a standard gaussian: such a vector is said to be delocalized, which 
means that with large probability, ||t/7v||oo/||t^Ar||2 is small; more specifically: 

\\Uhh^\N) 

After the initial paper of Marchenko and Pastur, a long list of further-reaching results 
about limit spectral distribution of empirical covariance matrices have been obtained, by 
Yin and Krishnaiah |3T], Gotze and Tikhomirov [201 EI], Aubrun |3], Pajor and Pastur 
[25] , Adamczak All of them are devoted to the empirical covariance matrix of more or 
less delocalized vectors, with limit spectral distribution being the Marchenko-Pastur dis- 
tribution (except in the case treated in [3T|, but there the vector are still very delocalized). 



In this paper, our goal is to drop the delocalization assumption and to be able to deal 
with localized U^^^s, i.e. with some entries much larger than the other ones. For example, 
the applications of our main theorem include the case where the entries of f/^ have heavy 
tails, but also in some other examples of localized vectors, such as the one where the law 
of Ul^ results from a Brownian motion with localized initial condition. 

This approach is based on our preceding works [HI [IS] on the Bercovici-Pata bijection 
(see also [U ID])- This bijection, that we denote by A, is a correspondence between 
the probability measures on the real line that are infinitely divisible with respect to the 
classical convolution * and the ones which are infinitely divisible with respect to the free 
convolution ffl. In [6]|T6], we constructed a set of matrix ensembles which produces a quite 
natural interpretation of A. This construction is easy to describe for compound Poisson 
laws and makes the connection with Marchenko-Pastur's model quite clear. Let (X*)j>i 
be still an i.i.d. sequence of real valued random variables, (f/)^)j>i i.i.d. column vectors 
uniformly distributed on the sphere of with radius a/ZV, and (-P(A), A > 0) a standard 
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Poisson process, (X*)j>i, {Uj^)i>i and (P(A),A > 0) being independent. For each > 1, 
we defined the random matrix 

P(7VA) 

(2) iV 5Z ■ ^NiUhT 

i=l 

and we proved that its empirical spectral law converges to A(/i) when goes to infinity, 
where is the compound Poisson law of 

P{A) 
i=l 

The link with Marchenko-Pastur's model of ([T]) is now obvious and it is easy to verify 
that the empirical spectral laws of ([T]) and ([2]) have same limit if p ~ A^A. Hence, our 
previous works [6l [16] could be viewed as another insight on Marchenko-Pastur's results, 
partly more restricted, since we only considered uniformly distributed random vectors f/^, 
partly more general, since our construction extended to all infinitely divisible laws. In 
fact, the main advantage of our matricial model, which has also been studied in [26], over 
Marchenko-Pastur's one is to be infinitely divisible. It allows us to derive simpler proofs, 
using appropriate tools as cumulant computations or semi-groups. 

In the present paper, we extend our construction to a larger class of Uj^^s, while con- 
tinuing to benefit of the infinitely divisible framework. Roughly speaking, we are able to 
prove the convergence of the empirical spectral law if we suppose only that the entries of 
are exchangeable and have a moment of order k growing as — ?■ oo at most in A^2~^, 
for any fixed k ( cf. Theorem 12.61) . 

Then, approximations allow to extend the result to f/^ with heavy tailed entries (see 
Theorem 13.11) . When the X*'s are constant, we recover a result by obtained by Belinschi, 
Dembo and Guionnet in |1]. Our result is more general from a certain point of view 
(we allow some random X*'s), but less explicit since we characterize the limit spectral 
distribution as the weak limit of a sequence of probability distributions with calculable 
moments and not with a functional equation, as in [4j. We also state (Theorem 13. 2p a 
"covariance matrices version" of Zakharevich's generalization of Wigner's theorem of |32] . 
which is a direct consequence of our key result Theorem 12.61 

We also devote a particular interest to the special case where the column vectors f/^'s 
are copies of 

(3) U^it) = VNe--^XN + Vl - e-'G^, 

where is uniformly distributed on the canonical basis (1, 0, . . ., 0)^,. . . , (0, . . ., 0, 1)"^, 
independent of Gjy := {zi, . . ., z^)"^ , with Zi, Z2, z^, . . . independent standard Gaussian 
variables. Let us emphasized that UN{t) is a quite typicaly localized vector since it has 



4 



FLORENT BENAYCH-GEORGES AND THIERRY CABANAL-DUVILLARD 



exactly one entry which is much bigger than the — 1 others. Precisely, we have: 

II^A^(^)lloo _ -t/2 

ll^ivWIh ^ 

This is the reason why the N ^ oo limits differ from the ones of the classical matrix 
models. For example, the classical Mar chenko- Past ur Theorem about empirical covariance 
matrices is not true anymore for such vectors. More specifically, we have: 

Theorem 1.1. Let Uj^,Uff,... be i.i.d. copies of the vector UN{t) G defined in ([3]). 
Let 

(4) M = lf^f/^(^^r 

be the (dilated) empirical covariance matrix of the sample U\i, . . ., U^. As N,p ^ oo with 
p/N — i- A > 0, the empirical spectral law of M converges to a limit law P\^t with unbounded 
support which is characterized by its moments, given by the formula 

(5) / x^PxM^) = e-''W*Al'^l 

7rePart(fc) 

with Part(A;) the set of partitions of {1, . . . , fc} and n defined by: 

• k,{tt) = /t(7ri) + ■ ■ ■ + ^(Trg) if 7Ti, . . . , vr^ are the connected components of tt; 

• if IT is connected, k,{it) is the number of times one changes from one block to another 
when running through n in a cyclic way. 



Note that the law P\,t is the Poisson law with parameter A if t = and tends to the 
free Poisson law (also called Marchenko-Pastur law) as t tends to +oo: the weight e~'^^'^^^ 
penalizes crossings in partitions (indeed, ^(Tr) = if and only if vr is non crossing). An 
illustration is given in Figure [T] below. In fact. Formula (E]) can be generalized into a 
more general moment s-cumulants formula which provides a new continuous interpolation 
between classicaly and freely infinitely divisible laws. This interpolation is related to the 
notion of t-freeness developed by the first named author and Levy in [11] and is based on 
a progressive penalization of the crossings in the moments-cumulants formula f|T9|) . 

The paper is organized as follows. Next section describes the main results; Section 3 
provides some applications; the following sections are devoted to the proofs; the last section 
is an appendix where we recall some facts on the infinitely divisible laws, the Bercovici-Pata 
bijection and (hyper)graphs, for the easyness of the reader. 

As this preprint was published, Victor Perez-Abreu informed us that he is working on 
a close subject with J. Armando Dommguez- Molina and Alfonso Rocha-Arteaga in the 
forthcoming preprint [T5] . 
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(a) Case where t — 0.01 (b) Case where t ~ 0.1 




(c) Case where t — 1 

Figure 1. The empirical spectral distribution of the matrix M of for N = 
4.10^, p = 8.10^ at several values of t. We see that this distribution, which is 
approximately the law P2,t, is close to the Poisson law with parameter 2 for t close 
to zero, and that it converges to the Marchenko-Pastur law with parameter 2 
(whose density is plotted by a smooth continuous line) as t grows. 

Acknowledgements. The authors thank James Mingo for a useful explanation during 
the program "Bialgebras in free probability" in the Erwin Schrodinger Institute in Wien 
during spring 2011. They also thank Thierry Levy and Camille Male for some discussions 
on a preliminary version of the paper. 



2. The main results 

2.1. A general family of matrix ensembles. The basic facts on the * -infinitely divisible 
laws are recalled in the appendix. For /i such a law, its Levy exponent is the function \E'^ 
such that the Fourier transform of /i is e*''^'-*. 
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2.1.1. Case of compound Poisson laws. Such a law // is a one of 

P(A) 
i=l 

where P(A) is a random variable with Poisson law with expectation A and the X,;'s are 
i.i.d. random variables, independent of -P(A). In this case, if i> denotes the law of the Xi's, 
the Levy exponent of // is = A J (e**^ — l)du{t). For > 1, let (?7^)i>i be a sequence 

of i.i.d. copies of a random column vector Un & IR^^^ or C-^^-*^. Then we define to be 
the law of 

P{N\) 
i=l 

where P{NX) is a random variable with Poisson law with expectation A^A, the Xj's are 
i.i.d. random variables with law u and the ?7^'s are i.i.d. copies of (whose conjugate 
transpose are denoted by {U'n)*), all being independent. Let us notice that is still a 
compound Poisson law and that its Levy exponent is given by: 

A^NE [^^{U*^AUn/N)] 

for any N x N Hermitian matrix A. 



2.1.2. General case. Here, we shall extend the previous construction for a general infin- 
itely divisible law. In the following theorem and in the rest of the paper, the spaces of 
probability measures are endowed with the weak topology. K. denotes either M or C. 

Theorem 2.1. Let ^ be an infinitely divisible law on M, let us fix N > 1 and let Un € K^^^ 
be a random column vector such that E,[\\Un\\2] < +oo. Let (f/)^)i>i be a sequence of i.i.d. 
copies of Un and, for each n > 1, let X^,X^, . . . be i.i.d. n*^ -distributed random 

variables. Then the sequence of N x N Hermitian matrices 

^ Nxn 

(6) ]^ E • U'MT 

i=l 

converges in distribution, as n — )■ oo (N being fixed), to a probability measure on the 
space of N X N Hermitian matrices, whose Fourier transform is given by 

(7) j e^^(^^)dPg(M) = exp {TVE ^{U*^AUm /N)]} 
for any N x N Hermitian matrix A. 



The following proposition extends the theorem to a quite more general framework, where 
the X*'s are i.i.d. but not necessarily distributed according to and only satisfy the 
limit theorem 

lawof(X^ + + n 

n— >-oo 
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for a sequence /c„ — )■ oo. 

Proposition 2.2. a) If the law of Un is compactly supported, then to any limit theorem 

^ y n— >oo 

kn times 

there corresponds a limit theorem in the space of N x N Hermitian matrices 

Nxk„ 



iV ' n— >cxD 

where Xl,X^, . . . ,X^^" are i.i.d. Vn- distributed random variables, (f/^)i>i is a sequence 
of i.i.d. copies ofUN, independent of the 's. 

b) In the case where the law of Un is not compactly supported, (jS]) stays true as long as 
one supposes that kn x J |t|z/„(dt) is bounded uniformly in n. 

Remark 2.3. Such a construction has been generahzed to the more general setting of 
Hopf algebras by Schiirmann, Skeide and Volkwardt in 



2.2. Convergence of the empirical spectral law. The empirical spectral law of a ma- 
trix is the uniform probability measure on its eigenvalues (see Equation (jH]) in the ap- 
pendix). When U]\j/\/N is uniformly distributed on the unit sphere, we proved that the 
empirical spectral law associated to P[^2 converges when the size N tends to infinity (c/. 
[HI US])- In order to obtain other convergences, we shall first make the following assump- 
tions on the random vector Up^ G K^^-"^. We denote the entries of Un by U^i^), • • • , Un{N). 
Roughly speaking, these assumptions mean that the [/Ar(i)'s are exchangeable and that the 
moment of order k of Un{^) grows at most in A^2~^, for all k. These assumptions will be 
weakened in the next section to consider heavy tailed variables. 

Hypothesis 2.4. a) For each N > 1, the entries ofU^ are exchangeable and have moments 
of all orders. 

b) As N goes to infinity, we have: 

• for each p>l, for each ii, . . . , i2p > 1, 

(9) nUN{^l) ■ ■ ■ Un{ip)Un{ip+i) ■ ■ ■ = 0{NP'^^'^'-'''^^^), 

• for each k > 1, for all positive integers Ui, . . . , n^, there exists T{ni, . . . , n^) finite 



such that 



(10) E 



|f/^(l)P"i |f/7v(fc)P" 



— > T{ni,...,nk) 



Moreover, there is a constant C such that for all k > 1, for all ni, . . . ,nk, 
(11) r(ni,...,nfc) <C"^+-+"^ 
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Remark 2.5. Let define the random column vector Un = diag{ei, ... ,eN)UN, with 
ei,...,eN some i.i.d. variables uniformly distributed on {z ^ K; \z\ = 1}, independent 
of Ujsf. Note that if 1/^ satisfies Hypothesis \2.A\ so does Um, with the same function F. 
Moreover, the expectation of ([9]) is null for Un as soon as the multisets {ii, . . . ,ik} and 
{ik+i, • • • , i2k} are not equal (in the complex case) and as soon as an element in the multiset 
{ii, . . . ,i2k} appear an odd number of times (in the real case). 



The following theorem is the key result of the paper. For its second part, we endow the 
set of functions on multisets of integers (it is the set that F belongs to) with the product 
topology. 



Theorem 2.6. We suppose that Hypothesis 2^ holds. 

a) For any *-infinitely divisible distribution fi, the empirical spectral distribution of a 
'^'ul^- distributed random matrix converges almost surely, as N — > oo, to a deterministic 
probability measure Ar(/i), which depends only on /i and on the function F of Equation 

b) The probability measure Ar(yu) depends continuously on the pair (/i,F). 

c) The moments o/Ar(/i) moments can be computed when fi has moments to all orders 
via the following formula, 

(12) fx'AM{dx)= J2 /rWn^M('")' (^^1) 

7r6Part{fc) JStt 

where the non negative numbers /r(7r), which factorize along the connected components of 
TT, are given at Lemma WT^ and the numbers c\j\{^) are the cumulants of it (whose definition 
is recalled in the appendix). 

d) If the Levy measure of fi has compact support, then Ar(/i) admits exponential moments 
of all orders. 



The following proposition allows to assert that many limit laws obtained in Theorem 12.61 
have unbounded support. Recall that a law is said to be non degenerate if it is not a Dirac 
mass. 



Proposition 2.7. // the function F of Hypothesis IK^ is such that inf F(n)^''" > 0, then 



for any non degenerate * -infinitely divisible distribution fi whose classical cumulants are 
all non negative, the law Ar(/i) has unbounded support. 



3. Applications and examples 



3.1. Heavy-tailed Marchenko-Pastur theorem. In this section, we use Theorem 12.61 
to extend the theorem of Marchenko and Pastur described in the introduction to vectors 



MARCHENKO-PASTUR AND BERCOVICI-PATA GENERALIZED 



9 



with heavy-tailed entries. This also extends Theorem 1.10 of Belinschi, Guionnet and 
Dembo in |1] (their theorem corresponds to the case Xj = 1), but we do not provide the 
explicit characterization of the limit that they propose. Also, our result is valid in the 
complex as in the real case, whereas the one of ^ is only stated in the real case (but we 
do not know whether this is an essential restriction of the approach of 

Let us fix a G (0, 2) and let (^ij)i,j>i be an infinite array of i.i.d. K-valued random 
variables such that the function 

(13) m :=rP(|Fii| >t) 

has slow variations as t — )■ oo. For each j, define the column vector := (Yij)^^ G K^^-^ 
{V^ depends implicitly on N). Let us define otv := inf{t; t~"^L{t) < 1/N}. 

Theorem 3.1. For any set {Xj)j>i of i.i.d. real random variables (with any law, hut that 
does not depend on N) independent of the Yij 's and any fixed A > 0, the empirical spectral 
law of the random matrix 

converges almost surely, as N,p ^ oo with p/N —> X, to a deterministic probability mea- 
sure which depends only on a, on A and on the law of the Xi 's. This limit law depends 
continuously of these three parameters. 



3.2. Covariance matrices with exploding moments. The following theorem is a direct 
consequence of Theorem 12.61 and Proposition 12.71 It is the "covariance matrices version" 
of Zakharevich's generalization of Wigner's theorem to matrices with exploding moments 
(see [32| and also the recent work [22j by Male). 

Theorem 3.2. Let M^^p = [Xij] be a complex N x p random matrix with i.i.d. centered 
entries whose distribution might depend on N and p. We suppose that there is a sequence 
c = {ck)k>2 such that cj^ is bounded and such that for each fixed k > 2, 



EWX 



11 



ifci 



N2- 



Then the empirical spectral law of 



Ck as N,p ^ oo with p/N — i- A > 0. 



converges, as N,p ^ oo with p/N — > A > 0, to a probability measure /iA,c which depends 
continuously on the pair (A, c). If Ck = for all k > 3, yUA,c is Marchenko-Pastur distribu- 
tion with parameter A dilated by a coefficient ^Jcq,. Otherwise, /iA,c has unbounded support 
but admits exponential moments of all orders. 
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3.3. Non centered Gaussian vectors, Brownian motion on the unit sphere and a 
continuum of Bercovici-Pata's bijections. In this section, we give examples of vectors 
Un which satisfy Hypothesis 12.41 for a certain function F. These examples will allow to 
construct the continuum of Bercovici-Pata bijections (A()(>o mentioned in the introduction. 

Let us fix t > and consider Un = U^^t defined thanks to one of the two (or three, 
actually) following definitions 

(a) Gaussian and uniform cases: 

(14) UN,t ■■= c-^VnXn + Vl-e-'GN, 

where X^r is uniformly distributed on the canonical basis (1,0,..., 0)^,. . . (0, . . . , 0, 1)"^ 
of K^^^, independent of '■= {di, ■ ■ ■ , Qn)^ , with Qi, g2, 93, ■ ■ ■ i-i-d. variables 
whose distribution is either the centered Gaussian law on IK with variance 1 or the 
uniform law on G K; \z\ = 1}, 



(b) Brownian motion: UN,t/^/N is a Brownian motion on the unit sphere ofC^^^ taken 
at time t, whose distribution at time zero is the uniform law on the canonical basis 
of C^^^. Such a process is a strong solution of the SDE 



(15) dUN,t = {dKt)UN,t-^UN^tdt, 

where K = {Kt)t>Q is a standard Brownian motion on the Euclidian space oi N xN 
skew-Hermitian matrices, endowed with the scalar product A ■ B = NTt{A*B). 

Of course, these models make sense for t = +oo : in the first model, it means only that 
UN,t = Gn and the second model, at t = +oo, can be understood as its limit in law, i.e. a 
random vector with uniform law on the sphere with radius y/N. For t = +oo, all formulas 
below make sense (and stay true) by taking their t — )■ +oo limits. 

Thanks to the results of it can be seen that the Gaussian model and the Brownian 
one have approximately the same finite-dimensional marginals, as N ^ oo. The following 
proposition, whose proof is postponed to Section [9|, makes this analogy stronger. 

Proposition 3.3. ForUN = U^^t o,s defined according to any of the above models, Hypoth- 
esis^^ holds for F := F^ given by the following formulas: 

(16) Fi(l^_^) = (l-e-*)'^ + A;e~*(l-e-*)'=-^ for every k>l, 

k times 

(17) Tt{n,l^^^^^ = e-''\l - e-^f for every n > 2 and k > 0, 

k times 

(18) Ft(ni, . . . , Uk) = if there is i ^ j such that rii > 2, rij > 2. 
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Let us now consider the family of transforms (Ar^ ,t > 0) (denoted by At in the intro- 
duction) . 

In both models, when t = 0, the rank-one random projector UN,tU'^t is a diagonal matrix 
with unique non zero entry uniformly distributed on the diagonal and equal to A^. In such a 
case, it can be readily seen that is the law of a diagonal matrix with i.i.d. entries with 
law /i. Owing to the law of large numbers, Ap^ is obviously the identity map. Moreover, it 
has been seen in [HI [IS] that for t = oo, Ap^ is the Bercovici-Pata bijection. 

Therefore, (Ap^t G [0, +oo]) provides a continuum of maps passing from the identity 
(t = 0) to the Bercovici-Pata bijection (t = +oo). This continuum is related to the notion 
of t-freeness developed by the first named author and Levy in [TT] . The maps Ap^ are made 
explicit (at least at the moments level) by the following proposition, whose proof, based 
on the explicitation of the functions /r^ , is postponed to Section [TUl 

Proposition 3.4. Let n be an infinitely divisible law with moments of all orders. Then 
for each t > 0, 

(19) l'x'AMidx)= Yl ik>l) 

7rGPart{A:) 

with K, defined by: 

• K(7r) = k{71i) + ■ ■ ■ + i^iiTq) if Til, ... , '^q orc the connected components of tt; 

• if n is connected, k{it) is the number of times one changes from one block to another 
when running through tt in a cyclic way. 



Example of computation of K{7r). For instance, let us consider the partition tt = 
{{1,8, 10}, {2, 4}, {3, 5}, {6, 7, 9}} G Part(lO) (c/. Figure [2]). Then the connected com- 
ponents of TT are the partitions tti and 1:2 induced by tt on the sets {1,6,7,8,9,10} and 
{2, 3, 4, 5}. Since = 5 and ^{112) = 4, ^(Tr) = 9. 




1 23456789 10 



Figure 2. The partition vr = {{1,8, 10}, {2, 4}, {3, 5}, {6, 7, 9}} G Part (10). 
Let us now say a few words about the bijection Ap^, as a function of t G [0, +00). 
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If t = 0, then: 

This corroborates the fact that Arg is the identity map. 
If t tends to infinity, then: 



x''A(/i)(dx) 



7reNC{fc) 



where NC(/c) denotes the set of non-crossing partitions of {1, ... , k} (see Section ISTTl for a 
precise definition). Indeed, k,{7i) is positive unless n is non-crossing. Since the continuity 
of Ar w.r.t. the weak topology is uniform in F, as it appears from the proof of Proposition 
16. 51 b). this proves that Ap^ tends to the Bercovici-Pata bijection A when t goes to infinity, 
as expected. 

3.4. The distribution P[f^ made more explicit. When /i is a compound Poisson law, 
the definition of P[^2 has been made explicit in Section I2.1.1[ In this section, we explicit 
^ul for /i a Dirac mass or a Gaussian laws (the column vector Un underlying the definition 
of P[^2 staying as general as possible). Since any infinitely divisible law is a weak limit of 
convolutions of such laws and obviously, by the Formula ([7]) of the Fourier transform of 
'P'^ul, the law P[^2 depends continuously on /i and satisfies 

p(^) *p|:r) =Fir\ 

Un Un Un ' 

this gives a good idea of what a random matrix distributed according to P[j^ looks like. 
Moreover, we also consider the case where /x is a Cauchy law, where a surprising behaviour 
w.r.t. the convolution appears. 

First, it can easily be seen that if n is the Dirac mass at 7 G R, then pjf] is the Dirac 
mass at 



^^UnUI,] = 7 E [\Un{1)?] - E Un{1)Un{2) 



+ rank-one matrix. 



Hence, due to Hypothesis 12.41 and Lemma [12. 2 ^ Ar(5^) = 5y for 7' = 7r(l). 

Suppose that now n is the standard Gaussian law A/'(0, 1) and that Hypotheses 12.41 
holds. Let tl^ be as in Remark 12.51 Then the distribution p|^^ only depends on a := 

E[|f/Ar(l)p|[/jv(2)|2], 7 ■.= ¥.[\Un{X)\^] and /3 := dimK(K) : when 7 > a, Pjf] is the law of 



N \ ■ \ N \ ■ \ Nf3 
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where gi, ■ ■ ■ ,gN,g are standard Gaussian variables and GO(U)E designs a GOE or GUE 
matrix (according to weither IK = M of C) as defined p. 51 of [2] {i.e. a standard Gaussian 
vector on tlie space of real symmetric or Hermitian matrices endowed with the scalar 
product A ■ B := ^TiAB). As a consequence, since when — )■ oo, a ~ r(l, 1) and 



(p) 

Un 

Ar(/i) = A^(0, r(2)) ffl (Semicircle law with variance 21(1, l)//3). 



7 ~ A^r(2), the spectral law of a P^-':' -distributed matrix converges to 



In the next proposition, we consider the case where /i is a Cauchy law with paramater t: 

tdx 

^ = ■= 

Let Pt be the associated kernel, defined by 

W)(^) = / i\x + y)CMy)- 

(c ) 

Proposition 3.5. We suppose that K[UnU^] = In- For Mt a -distributed random 
matrix and / : M — t- M bounded, for any N x N Hermitian matrix A, 

E[f{A + M,)] = P,{f){A), 

real functions being applied to Hermitian matrices via the functional calculus. 



The following Corollary follows easily, using the vector Vjq := / y r(l) instead oi Un 
(with Un 8.8 defined in Remark [2.51) . 

Corollary 3.6. Under Hypothesis \2.4\ the set of Cauchy laws is invariant by Fa .' for any 



t>0, AriCt)=Ct>, fort' = T{l)t. 

4. Proof of Theorem 12.11 and Proposition 12.21 

One proves the theorem and the proposition in the same time, by showing that under 
the hypotheses of the theorem or of a) or b) of the remark, the Fourier transform of the 
left hand term of (|8]), that we denote by M„, converges pointwise to the right hand term 
of ([7]). Indeed, for any Hermitian matrix A, 



Ef^.Tr(AM„)^ ^(^^ E[^4U*^AUn/N)] \ 



where for any x G M, '^n{x) '■= knj{e^*^ — l)dz/„(t). The function \E'„ converges to ^ 
as n — )■ cxD, uniformly on every compact subset of M (this follows from [271 Lem. 3.1] 
and from the fact that v^" — )■ fi uniformly on every compact set). It is enough to prove 
the result in the case where the law of Un is compactly supported. To conclude in the 
case where z/„ = /i*^ (resp. in the case where / |t|z/„(dt) = 0(l//c„)), one needs to argue 
that there is a constant C such that for all t > 0, \I'^t(A) < tC (|A| + A^) (resp. that 
|^n(a:)| < A;„/|t|z/„(dt)). 
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5. Preliminaries for the proof of Theorem 12.61 : partitions and graphs 

The aim of this section is to introduce the combinatorial definitions which will be used in 
the next section in order to prove Theorem l2.6[ The conventions we chose for the definitions 
of partitions, graphs, and for the less well-known notion of hypergraph are presented in 
Section 112.31 of the appendix. 

5.1. Partitions. 

• Let us recall that we denote by Part(A;) the set of partitions of = {1, . . . , k}, 
and by NC(fc) the set of non-crossing partitions of {l,...,k} (a partition vr of 
{1, . . . ,k} is said to be non-crossing if there does not exist x < y < z < t such that 
X z ^ y ^ t). 

• For any given partition vr G Part(/c), we denote by nc(7r) be the minimal non- 
crossing partition which is above vr for the refinement order; the partitions induced 
by vr on the blocks of nc(7r) are called the connected components of vr. 

• If nc(7r) has only one block, then vr is said to be connected. 

• We define thin(7r) to be the partition induced by vr on the subset of {l,...,/c} 
obtained by erasing C whenever £ -|- 1 ~ £ (with the convention k + 1 = 1). 

• A partition vr is said to be thin if vr = thin(vr). 



For instance, for vr = {{1, 8, 10}, {2, 4}, {3, 5}, {6, 7, 9}} G Part(lO) the partition nc(vr) 
is {{1, 6, 7, 8, 9, 10}, {2, 3, 4, 5}} and Figure [3] illustrates of the operation vr i— thin(vr). 



For any function / defined on a set E, we denote by ker / the partition of E whose blocks 
are the level sets of /. For any partition vr G Part(/c), for each £ G {1, . . . , fc}, we denote 
by vr(£) the index of the class of £ in vr, after having ordered the classes according to the 
order of their first element (for the example given Figure [21 we have vr(l) = 1, vr(2) = 2, 
vr(3) = 3 and vr(4) = 2, vr(5) = 3, vr(6) = 4),...). Moreover, is identified with k and k + 1 
is identified with 1, so that vr(0) = vr(A;) and vr(fc -|- 1) = vr(l). 



5.2. Graphs. Let G = {V^E = {ei,i G /)) be a graph, and vr a partition of V. Let vr be 
the canonical surjection from V onto vr. The quotient graph G is the graph with vr as set 





1 2 3 4 5 6 8 9 



Figure 3. Illustration of the operation vr i— )• thin(7r) 
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of vertices and with edges {e['^\i G /) such that e['^^ is the edge between 7r{v) and 7r{w) if 
Cj is an edge between v and w, with same direction if the graph is directed. Note that the 
quotient graph can have strictly less vertices than the initial one, but it has as much edges 
as the initial one. Note also that the quotient of a circuit is still a circuit. 

For instance, let us consider the preceding partition vr and G be the cyclic graph with 



vertex set \^ = {1, . . . , 10} and edges l—)-2— 10— t-I (c/. Figure 4(a) where we 



draw the partition tt with dashed lines). The quotient graph G/^ is then given by Figure 



4(b) with VI = {1, 8, 10}, V2 = {2, 4}, V3 = {3, 5} and VA = {6, 7, 9}. 




Figure 4. A cyclic graph and one of its quotients 



Let G = (V, E) a directed graph. For any vertex v E V, we denote by E^{v) the subset 
of out-going edges from v, and by E^{v) the subset of in-going edges to v. 

E+{v) = {eeE; {3v' G 7r)(e = {v, v'))} 
E-{y) = {ee E; {3v' e7T){e= {v',v))} 

Notice that E^{v) and E~{v) are not necessarily disjoints. 



Let us consider {1,...,A^}'^ as a set of "colorings" of the edges of G by the col- 
ors 1,...,A^. For any coloring c G {1, . . . , A^}'^, let n+(t>,i) := \E^{y) H c~"^({z})| and 
n~(f , i) := \E^{v) nc^^({z})|. If for each vertex v and each color i, n^{v, i) = n~{v, i), the 
coloring is called admissible. 

Figure [5] presents an instance of an admissible coloring of G with three colors. 

A partition r of the edges of G will be said admissible if it is the kernel of an admissible 
coloring. In this case, for any vertex G G, we define 

(20) ririv) := {nr{v,l),n,{v,2), ) 

to be the common value of the decreasing reordering of the families {n'^{v, 1), n^(f , 2), 

for c coloring such that kerc = r. 
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V3 



Figure 5. An admissible coloring 

5.3. Hypergraph associated to a partition of the edges of a graph. Let G be a 

graph with vertex set V, n he a. partition of V and r be a partition of the edge set of G. 
Then one can define -ff(vr, r) to be the hypergraph with the same vertex set as G (i.e. vr) 
and with edges {Ew, W E r), where each edge Ew is the set of blocks J G vr such that at 
least one edge of G/^ starting or ending at J belongs to W. 

For instance, with vr, G /■„ and r as given by the preceding example Figure [5l H{ii, r) is 
the hypergraph with three edges drawn in Figured Another example, where H{'k,t) has 
no cycle, is given at Figure [71 




Figure 6. The graph H{tt,t) with tt, G/^ and r as given by the preceding ex- 
ample Figure [5] 

The next proposition will be used in the following. For r a partition of the edges of a 
graph, a cycle of the graph is said to be t- monochromatic if all the edges it visits belong 
to the same block of r. 

Proposition 5.1. Let G = {V, E) be a directed graph, which is a circuit. Fix tt G Part(V^) 
and T be a partition of the edge set of G such that H{tt,t) has no cycle. Then G/^ is a 
disjoint union of t -monochromatic cycles. As a consequence, r is admissible. 

Proof. If r has more than one block (the case with one bloc being obvious), since G/^r is 
a circuit, one can find a closed path 

7 = {vo, ei, vi, 62, ... , Cfe, Vk) (with Vk = Vq) 
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of G/vr which visits each edge of G /■„ exactly once and such that ei and Ck do not belong to 
the same block of r. For each edge e of G/tt, let E{e) be the edge of -ff (vr, r) consisting of 
all vertices of G which are the beginning or the end of an edge of G with the same color 
as e (i.e. in the same block of r as e). Then (uq, i?(ei), wi, E{e2), ■ ■ ■ , E{ek),Vk) is a path of 
H{ti, r), i.e. for all £=!,<, k, G E[e(). Let us introduce Iq = Q < ii < ■ ■ ■ < ip = k 

such that ii, . . . ,ip are the instants where the color of the edges change in 7 (i.e. for all 
s = 1, . . . ,p, ej^_-^+i, . . . , Cj^ have the same color, and for all s = 1, ... ,p — 1, e^^ and Cj^+i 
do not have the same color. Then 

7 := {vio,E{ei^),Vi,, ,Vi^_,, E{ei^),Vi^) 

is a also a path of H{tt, t). 

Since H{7i, r) has no cycle, hence is linear, = E{ei^) fl i?(ej^^J for r = 1, . . . ,p — 1 
and = Vi^ = E{eiJ r\ E{ei^). Let f j^, f j^^^, . . . , t>j^ a sequence of pairwise distinct 
vertices, with Vi^ = Vi^ and r < s (this exists since Vi^ = Vi^). Then s = r + 1 otherwise 
(f j^, E{ei^^-^^), f j^+i, . . . , -E(ejJ, f would be a cycle of -ff (tt, r). Let us consider the path 
(^irj 6jr+i5 ^ir+i5 • • • ! 5 J • cdges havc thc same color, and this is a disjoint union 

of r-monochromatic cycles. 

Let us now apply the same trick to the reduced path 

{vi^, E{eiJ,Vi„ ...,Vi^, E{ei^^J, Vi^_„ E{e^^),Vi^). 

This allows us to achieve by induction the proof of the proposition. 

□ 



An example of r such that H{7i, r) has no cycle is given at Figure [7] below (another 



example of hypergraph with no cycle is given in Figure 10(b) of the appendix). 






Figure 7. On the left: an admissible partition r of the edges of G /^^ whose 
corresponding hypergraph H{7r, r) (on the right) has no cycle. 
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6. Proof of Theorem 12.61 



The proof of Theorem 12.61 will follow from Propositions 16.11 and 16.51 and Lemma 16.71 
below. We suppose that Hypothesis 12.41 holds. 



6.1. Convergence of the mean spectral distribution in the case with moments. In 

this section, we prove the following proposition. The parametrization of infinitely divisible 
laws via formula /i = v^'" is introduced in Section [12.21 of the appendix. 



Proposition 6.1. Consider an infinitely divisible law /i 
support. Define the mean spectral distribution A 
by 



V. 



7,cr 



(m) 

N 



of a pI^'* -distributed random matrix M 



such that a has compact 



g{t)K^i;\<\t)=¥. 



1 



Tr g{M) 



(for any Borel function g). 



Then A^^'^ converges weakly, as N - 

only on /i and on the function T introduced in Hypothesis \2.4 



oo, to a probability measure Ar(/i) which depends 

This distribution admits 



exponential moments of any order, hence is characterized by its moments, given by Formula 
f ll2p in Theorem \2.(A 



Let us introduce a P^-distributed random matrix M and fix > 1. Proposition 16.11 
follows directly from Lemmas 16. 2[ 16.31 and 16.41 below. 

Lemma 6.2. For any N >1 and k >1, there is a collection (/Ar(vr))^gpart(fc) of numbers 
depending only on the distribution of such that 

7rePart(fc) 

Moreover, for any n, /Ar(7r) is given by the formula 
(21) /jv(7r) = AtW-^-'^E 

where (t/)^r)^>l is a sequence of i.i.d. copies ofUjq. 



E 



N 



n 



Proof. For each n > 1, set = jj*^ and let ... be a sequence of i.i.d. 

distributed random variables, independent of the f/^'s, so that according to Theorem 12. H 
for 

Nxn 
i=l 
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the law of M„ converges weakly to P[^. Then it is immediate that for n large enough, 



E 



Ttm: 



— y 



{Nn) 



Nk+i ^ (iVn-|7r|)! 

7rePart(A:) ^ ' 



TrJIf/ 



N 



where for any tt, 



Ha- 



n 



Note that since the U^f^s are column vectors, Tr J^^^j^ ?7^''^^[/^''^^* = Yli=iUp^^*U'^^^~^^\ 
Moreover, by [01 Th. 1.6], nl'^lm^(i/„) — )■ CT,{jj), which allows to conclude. □ 

Let us now consider the large limit of /Ar(vr). Let G be the cyclic graph with vertex 
set F = {1, . . . , k} and edges l—)'2— j-Ze—t-I. The hypergraph H{tt, t) is defined 
in Section 15.31 and the operator thin on partitions is defined in Section 15.11 

Lemma 6.3. For each it G Part(A;), there is /r(7r) > such that /Ar(vr) — > /r(7r). The 

N^oc 

number /(vr) factorizes along the connected components of tt and is given by: 



(22) 



/r(vr) 



nrK(j)). 



r partition of the edges of J^-k 
G/^ s.t. H{tt,t) has no cycle 



If, moreover, Un is a unitary vector, for any tt, /r(7i") = /r(thin(7r)) and for tt non crossing 
/r(7r) = 1. 



Before proving the lemma, let us give two examples to make the set of partitions r such 
that H{tt, t) has no cycle a bit more intuitive. 

a) Consider for example the case k = 4 and tt = {{1}, {2}, {3}, {4}} (trivial partition). 
Then it can easily be seen that there is only one such r: it is the trivial partition with only 
one block. 

b) Consider now the partition tt = {{1}, {2, 4}, {3}}. Then there are two such r's: 
the first one is the one with only one block, containing all vertices; the second one is 
{{(1,2),(4,1)},{(2,3),(3,4)}}. 



Proof. • Let us first prove that /Ar(vr) has a limit /r(7r) given by Formula f l22p as — )■ oo. 
We start with Formula ([21]), expand the matrix product, and use the independence of the 
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[/^'s and the exchangeability of the entries f/Ar(l), . . . , UNiN) of Un- 



■7r(£)*^^7r(^+l) 
N 



lluf>u 



p^\n\-l-k 
jylvrl-l-fc 

jykl-i-fc 



l<ii,...,ik<N 



7r(<?+l)| 



i=l 



E He 

l<ii,...,ife<Af JStt 



/eJ ^' s.t. e'+ieJ 



r partition 
of the edges of G/^ 



ieJ e' s.t. e'+iej 



where for all r, A^' = A^(A^ - 1) ■ ■ ■ (iV - |r| + 1) and i"^ is a coloring of the edges of G /t, 
with kernel r. 

Now, for each r, for each J, let a-r{J) be the number of blocks of r containing an 
edge having J as an extremity. By Formula ([9]) of Hypothesis \2A\ a.s N oo, the term 
associated to r is 

and that in the particular case where r is admissible, this term is equivalent to 



ArW+M-i~E,/e.a.(J) -Q rK(J)). 



Let us consider the hypergraph H^tc, t) defined at Section [573| with edges (-Evk, W E t). 
Notice that each edge of G/t^ is included in one edge of -ff(vr, r). This implies that //(vr, r) 
is connected. Notice also that: 

^a^(J) = ^ Ij6i5^^ = J] \Ew\ 

Hence, using Property 112. 5[ we know that 

|7r| + |r| - 1 - ^a(J) = |7r| + |r| - 1 - ^ \E- 



w 



Je-iv 



is non positive, and vanishes if and only if H{'k, t) has no cycle. By Proposition 15. 1[ this 
proves that /Ar(7r) converges when N goes to infinity, with a limit given by 



• Let us now prove that friir) factorizes along the connected components of vr. 

Suppose that vr has at least two connected components, and let G be the cyclic graph 
on {1, ... , k} introduced in the statement of the previous lemma and G/tt = (tt, E) be the 
associated quotient graph. There exists at least one connected component tt\a of vr whose 
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support A is an interval [/, /']. The corresponding subgraph of G is connected to the rest 
of the graph through two directed edges, e = (Vi, V2) and e' = (V3, V4) with / — 1 G Vi, 
/ G V2, /' G V3 and /' + 1 G V4 (with the usual identifications = A; and 1 = A; + 1). 

For instance, if we still consider the partition vr = {{1, 8, 10}, {2, 4}, {3, 5}, {6, 7, 9}} G 
Part(lO), then the restriction of vr to the interval {2,3,4,5} is a connected component of 
71, and the corresponding subgraph of G /t, {cf. Figure [5] page [16]) is its restriction to the set 
of vertices {V2, V3}, and is connected to the rest of the graph by the edges e = (Vl, V2) 
and e' = {V3,V4): 



V4/ 



,V2 



V3 

Figure 8. The quotient graph G/^, with the edges e and e' 

Let us define the graph T (^G obtained from G/^ by replacing the edges e and e' by 
the edges eo = (V3, V2) and ei = (Vi, V4). This graph has two connected components, one 
on the subset of vertices it\a, the other on the complementary tt^c The first (resp. second) 
one, that we denote by Gi (resp. Gq), is the restriction of G/t^ to A (resp. A''), plus the 
edge El (resp. Eq). 



For instance, T(G/Tr) is drawn for the preceding example in Figure 9(a 



Let us consider a partition r of the edges of G/^ such that H{n, r) has no cycle. Since, 
by Proposition 15.11 G is a union of pairwise disjoint r-mono chromatic cycles, e and e' 
belong to the same block of r. Let us define the partition T(r) of T (C /^) deduced from 



r by replacing e and e' by Eq and Ei in their block of r (see Figure 9(b) ). It is easy to see 
that T is a bijection between the set of partitions r of the edges of G such that i?(vr, r) 
has no cycle and the set of pairs of such partitions of the two connected components of 
T{G/t,). Moreover, ^^(J) = nT{T){J) for any J G vr. The conclusion follows directly. 

• Under the additional hypothesis that Un is a unitary vector, it follows directly from 
Formula (12T|) that fNij) = /Af(thin(7r)), hence /r(7r) = /r(thin(7r)). It also follows from 
Formula (12T1) that for k = 1, /Af({{l}}) = 1- If tt is non crossing, then the partition 
induced by tt on its connected components are all some one-bloc partitions. Hence the fact 
that /r factorizes on its connected components and the identities /r(7i") = /r(thin(7r)) and 
/r({{l}}) = 1 iterated imply that /r(7r) = 1. □ 
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V3 V3 
(a) The graph T{G /^) (b) The coloring T{t) 

Figure 9. The graph T{G/^) and the coloring T{t) 

Lemma 6.4. Let us define mo = 1 and, for each k > 1, 

mk= ^ /r(7r)c^(/i). 

7rePart(fc) 

Then the radius of convergence of the series Ylk>o ^'^ infinite. 

Proof. Claim: it suffices to prove tlie result under tlie additional hypothesis that r(-) = 1. 

Let us prove the claim. Recall that 7, a are such that /i = v^'" ■ Let a be the push- 
forward of the measure a by the map 1 1— )■ |t| and define /2 := ut^^''^ . Note that by Formula 
( ISUjl of the appendix, for any \ck{ij)\ < Cfc(/i). Let us define, the function r(-) by 
r(ni, . . . , nfc) := 1 for any integers rii, . . . , rife. For C the constant of Equation f lTTj) . we 
have, for any A; > 1 and any tt G Part(/c), 

< /rW < CVf (^)- 
As a consequence, if one defines, for each k >0, 

rhk ■■= ^ /f (7r)c^(/i), 

7rePart(fc) 

we have |mfe| < C'^rhk, and it suffices to prove the lemma for rhk instead of mk- We have 
proved the claim. 

So from now on, we suppose that r(-) = 1. It allows us to choose a particular model for 
the random column vector Un- we choose Un such that its entries ?7Ar(l), • • • , Un{N) are 
i.i.d. random variables whose law is the one of y/NBe^^, where B has law (1 — 1/N)6q + 
{1/N)6i, and is independent of 6, who has uniform law on [0,27r]. This vector obviously 
satisfies Hypothesis 12.41 with r(-) = 1. 

Let us define the function gg{A) := j^Tt e^^ on Hermitian matrices. To prove the lemma, 
it suffices to prove that for any G M, the sequence (gg) stays bounded as — )■ 00, 



MARCHENKO-PASTUR AND BERCOVICI-PATA GENERALIZED 23 

which, by definition of the law , will be implied by the fact that 

limsuplimsup— E[Tre^''^^=i ^n-'^jvL'jvj < 

N—^oo ra— >oo -i' 

where for each n, the X^'s are i.i.d. with law /i*", independent of the f/^'s, some i.i.d. 
copies of Un- 

The Golden-Thompson inequality (c/. ^30^ Sect. 3.2]) states that for Ai,A2 Hermitian 
matrices, TTe^^~^^^ < Tie^^e^^. In the case where Ai, . . . ,Ak are random, independent, 
such that for all i, K[e^^] = rriil]^ with > 0, we can deduce, by induction on /c, that 

(23) ^^['^^ gAi+-+Afc] < mi ■ ■ ■ mfc. 

Let us now compute 'K[eT^-^'^'^'^^^] for X„ a yU*™ -distributed variable independent of Ujq. 
Note first that since the Levy measure a of fi has compact support, by for instance [251 
Lem. 25.6], for all n, the Laplace transform of /i*" is defined as an entire function on C 
and given by the fonction e^'^^'\ where 

0(A) = A7 + / (e^* - 1 - YT^)^ cr(dt). 



--^ for t = 



Since is a column vector, we have 

„ e^^"ll^'^ll' - 1 

(24) ei^^"-^'-^^ = In + UnU*^, 

\\Un\\ 

with the convention (e* — l)/t = 1 for t = 0. For any diagonal matrix D with diagonal 
entries on the unit circle, DUn has the same law as Un, hence the expectation of the 
right-hand term of fl2^ is a diagonal matrix. Moreover, by exchangeability, for any /, 



E[f{\\UNr)\UNm'] = ^E[/(iif/^f )iif/^f ] {i<k< N), 



so 



Efeiv^-^A^f^iv] = + _E[elv^"ll^^^ll - 1]/^ = + ^ i/^. 

Hence by fl23|) . we have 



Nn 



— ETre'v^i=i -^n-c^iv^jv < i j [ J 

As n — )■ oo with fixed, the function n(e^^^'^ — 1) converges weakly to 0(-). Moreover, the 
random variable \\Un\\ takes a finite set of values, hence nE[e"''^*^''^'^" ^ — 1] — E[0(||f/7v|P)], 
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SO that 

n.— s>oo N 

To conclude, it suffices to verify that E[0(6'||f/Ar|p/A^)] stays bounded as — )■ oo. It follows 
from the fact that the law of ||[/Ar|p/A^ is the one of YliLi where the Sj's are i.i.d. with 
law (1 - l/N)6o + (l/iV)5i. □ 



6.2. Convergence of the mean spectral distribution: extension to the case with- 
out moments and continuity of the limit. 



Proposition 6.5. a) For any *-infinitely divisible law fi, the mean spectral distribution 



Aj^^ of a P[^^^ -distributed random matrix M , defined by 



g{t)k^i:\dt)=¥. 



^Tr(7(M) 



(for any function g ), 



converges weakly, as N — > oo, to a probability measure Ar(/i) which depends only on fi 
and on the function T introduced at Equation f lTOj) . 



b) The limit measure Ar(/i) depends continuously on the pair {fi, T) for the weak topology. 



Proof, a) Let (7, a) be the Levy pair of /i. Note that if a is compactly supported, the 
result has been established in Proposition |6]T1 so we shall prove the result thanks to Lemma 
[1221 

Let £ > and M > 1 such that Aoo := cr(M\ [-M, M]) < e. Let ae (resp. (Too) the 
restriction of a to [—M,M] (resp. M\[— M, M]). Let fioo be the compound Poisson law 
with Fourier transform: 

yeeR, J e^'*d/ioo(t) = exp Q (e'^^ - l) i±^aoo(f/x)^ 

The Levy pair of /ioo is (700, o'oo) for some 700 G M. Let 7e = 7 — 7oo £ R, and let /i^ the 
infinitely divisible law with Levy pair (7g,(T£). The law fi^o is a compound Poisson law 
whose underlying Poisson variable has a parameter less than 2Aoo, so by construction of 
the law P[f2 for II a compound Poisson given at Section I2.1.H the expectation of the rank 

of a ^-distributed matrix is not larger than 2£N. 

Since fi = fi^ * fioo, have P[^] = Pj^";^ * f\^^^ by (fT]). The mean spectral law of a 
P^J^''- distributed matrix converges for any e, so the result holds by application of Lemma 

b) Let fin be a sequence of infinitely divisible probability measures that converges to 
a probability measure n (by [251 Lem. 7.8], we know that /i is infinitely divisible). It is 
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known (see [El Th. 1, §19]) that if (7„, (T„) denotes the Levy pair of fj,n and (7, a) the one 
of /i, the weak convergence of /i„ to /i is equivalent to the fact that 

(25) 7„ — > 7 and <7n{g) — > cy{g) for any continuous bounded function g. 

Let us also consider, for each n > 1, a sequence {U^n'')n>i of random column vectors 
which satisfies Hvpothesis 12.41 We suppose that there is another sequence {Un)n>i of 
random column vectors which satisfies Hypothesis 12.41 such that the function r^") associated 
to (f/^^)Af>i converges pointwise to the function F associated to the sequence {Un)n>i- 

Let us now prove that the sequence Ap{n)(/i„) converges weakly to Ar(/u). 

Let us first suppose that there exists M > such that 

Vn>l, o-„([-M,M]") = 0. 

Then, since the cumulants of yU„ are more or less the moments of cr„ (see Formula fl5Up for 
more details), we have convergence of the cumulants : for all k, Ck{fin) — > Ck{fi). Hence 

n—^oo 

by Formula (fT2l) . the moments of Ap(n)(yU„) tend to the ones of Ar(/i). It implies the weak 
convergence, since Ar(/i) is determined by its moments, as stated in Proposition 16. 11 

Let us now deal with the general case. It suffices to prove that the Cauchy transform of 
Ar(n)(/in) converges pointwise to the one of Ar(/i). So let us fix 2; G C"^ and e > 0. Define 
the function g{t) := As in the proof of a), one can find M such that (t(M\[— M, M]) < e 
and for all n, (T„(M\[— M, M]) < e. Hence with some decompositions fi = fis * /ioo and 
f^n = f^n,£ * /^oo,e as abovc with /ie and fin,e having some Levy measures supported on 
[— M, M] and /ioo and /i„,oo some compound Poisson measures, we have 

(26) |Ar(/x)((7) - Ar(/i.)((?)| < ^, |Ar(/i„)((?) - Ar (/.„,,) ((/)| < ^. 

This is a consequence of the resolvent identity (see Equation fHB]) . Lemma 112.21 in the 
Appendix). Moreover, by what (^S^, ^in,e converges weakly to yU^ as n — )■ 00. So by what 
precedes about the compact case, Ar(n)(/i„,e) converges weakly to Ar(/ie). Joining this to 
( 126|) . we get the desired result. □ 



6.3. Concentration of measure and almost sure convergence. To conclude the proof 
of Theorem 12. 6[ we still need to prove that that not only the mean spectral law of a P[j^- 
distributed random matrix M converges to Ar(/i) as — > 00, but also, almost surely, the 
empirical spectral law. By the Borel-Cantelli lemma, it follows directly from what precedes 
and the following lemma. 

Remark 6.6 (The associated Levy process). There exists a Levy process {Ms)s>q with 
values in the space of N x N Hermitian matrices such that Mg has distribution = 
every s > (this is due to the fact that the distribution F^'^^ is infinitely 
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divisible, see e.g. [2S1 Cor. 11.6]). Let (^Ps^''^\s > be the corresponding semi-group: 
for every Hermitian matrix A, every bounded function /: 

P^'''^\f){A)=E[f{M, + A)]. 

Let ^(^'/^^ the infinitesimal generator associated to this semi-group. Its domain contains 
the set of twice continuously differentiable functions on NxN Hermitian matrices vanishing 
at infinity and if (7,0") denotes the Levy pair of fi {cf. Proposition-Definition [TT^ . for 
any Hermitian matrix A, we have, by Th. 31.5], 



N[E [f {A + xUNU*^)]-f{A)] 



X 



1 + x' 



;df{A) [D 



N 



1 + X^ 



X^ 



a{dx) 



where df denotes the derivative of / and Djy = K[UnU^]. Note that for / : M — )• M 
extended to Hermitian matrices by spectral calculus, we have df{A) [I^-] = f'{A). 

Lemma 6.7. Consider /i an infinitely divisible law and : M — > M a Lipschitz function 
with finite total variation. Then for every e > 0, there exists 6 > such that for all N > 1, 



P 



N 



Tr^(M) -E 



TigiM) 



> e] <2e 



^N5 



with M a Plf "* -distributed random matrix. 



Proof. This is an extension of Theorem HI. 4 in [16] which is established for a special 
case of [/at. Its proof only uses the fact that \\Un/\/N\\2 = 1 (one has to notice that in 
the present paper, what plays the role of Un in [16] is Un/VN)- Theorem HI. 4 in [16] 
can be readily extended to the case when E \\Um /y/N\\t is bounded w.r.t. A^. Then, we 



conclude noticing that by exchangeability and Equation ([TU]) . E ||f/jv/viV|| 
to r(2) + r(l,l). Indeed, 

i i^j 

= N~'E[\Um{1)\'] + (1 - N-')E[\Uj,{l)\'\Uj,i2)\'] 



converges 



E 



\Un/VN\\ 



□ 



7. Proof of Proposition 12.71 

For each n > 1, let m„ be the n^^ moment of Ar(/i). Let c = inf r(n)^/". It suffices to 
prove that there is e > such that for all n even, 

m„ > X 72*^ moment of a standard Gaussian variable. 
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i.e. that for all n even, m„ > e"' \ Part2(7^)|, where Part2(^) is the number of pairings of 
{1, . . . ,n}. The formula of m„ is given by ( 1T2|) . where each term is positive. Moreover, 
we know that C2(/u) = Var(/i) > 0. Hence it suffices to notice that for any A; > 1 and any 
TT e Part2(fc), /r(7r) > c^, which follows from the expression of /r(7r) as a sum of positive 
terms in (!22|) . where the term associated to the trivial partition r with one block is 

l[T{\J\)>l[c\'\ = c'. 

Jew Jew 



8. Proof of Theorem 13.11 

First, in the case where a > 1, we can suppose the y^j's to be centered. Indeed, replacing 
the Yij^s by Yij — IE[Vij] is a rank-two perturbation of M^r^p and by Lemma [12.21 a rank- 
two perturbation of an Hermitian matrix has no influence on the weak convergence of its 
spectral law. 

Let {P{t),t > 0) be a standard Poisson process, independent of the other variables. For 
each B > 0, let us introduce both following approximations of the random matrix Mtv,^ : 

Mn,p ■=—Y1 ■ V'iV'T and := — ^ X, ■ V'{V^y 

with := {Yijl 

|yjj|<Bajv)j=i G K^^^ (this column vector depends implicitly on and 
on the cutoff parameter B). It is noticed in Sections 1 and 9 of p] that for : = 

^Yiil|yiil<Baiv, we have, for each n > 1, 



]S[n-i 2n- a 

and v^E[zJf^] = 0(1) (th is is where the recentering is necessary when a > 1). So 
Hvpothesis 12.41 holds for Un distributed as the V^-^'s, with 

k 

(27) r(ni, . . . , nfc) = P^^) {n,, . . . , n,) := B^^^^+ J] ■ 



a 



2np — a 
=1 ^ 



As a consequence, by Theorem 12. 6[ the empirical spectral law of converges almost 
surely, as — )■ oo, to the law Ap(s)(/i), with /i the (compound Poisson) law of Yl^=i ^j- 
By Lemma [12.21 (applied with = M^^p and and = Mn for all e > 0), the same 
holds for Mm,p, because by the Law of Large Numbers, as N,p ^ oo, with p/N — > A, 

rank(M.^- g„,) _^ ^ ^^^^^^^ ^^^^^^^ 
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To prove the convergence of the empirical spectral law of M^^p, by Lemma ri2.2[ it suffices 
to prove that for any e > 0, if -B is large enough, then almost surely, for N,p large enough, 

(28) rank(Mw,p - Mn,p) < Ne. 

But we have 

p 

rank(M;v,p - Mn,p) < ^21^,^^,. 

i=i 

The above right-hand-term is a sum of p independent Bernoulli variables with parameter 

■.= l-{F{\Yu\<Bar,)f. 
We have, for L the slow variations function introduced at ffT3l). 



P(|Fn| <5a,v) = l-5-"a/^(Saiv) = l-5-"a^"L(a,v)x:^^^ = 1 ^ "(1 + ^^^ 



where is a sequence tending to zero. As a consequence, as iV — )■ cx3, qj^ tends to 
1 — " . By some Laplace transform estimates like p. 722 of [5], there is a constant c 
such that 

P(rank(Mjv,p - M/v,p) > ^V^In) < 6"^^"'^, 
hence by Borel-Cantelli's lemma, almost surely, for N^p large enough (A^ and p grow 
together in such a way that p/N — )■ A), rank(MAr^p — Mn,p) < 4pgAr. But 1 — " ~ B'"" 
as 5 — )■ oo, so for any fixed e, one can choose B such that almost surely, for N,p large 
enough, (128|) holds. It concludes the proof of the convergence. 

Let us now denote by u the law of the Xj's and by Aa(z/, A) the limit of the empirical 
spectral law of Mj^^p. We shall prove that this is a continuous function of (t/, a. A). By 

the formula given for F*^^-* at (l27j) . for any fixed cutoff B, the limit spectral law of Mjy^p 
depends continuously on the parameters (z/, a. A). Since the cutoff necessary to obtain a 
right-hand-term Ne in (128|) can be chosen uniformly on a and A as soon as they vary in 
compact sets which do not contain zero (regardless to the law of the A^^ 's), this proves the 
continuity. 



9. Proof of Proposition 13.31 

We first treat (together) the Gaussian and uniform cases. Exchangeability is obvious. 
Let us now prove that for any diagonal matrix D = diag(£:i, . . . , ^at) with diagonal entries 
in {z G K; \z\ = 1}, 

(29) UnU*^ '= DUnU*^D*. 

It will imply that the expectation of ^ is null as soon as the condition on multisets given 
at Remark 12.51 is satisfied. Let {ei, . . . , cat} denote the canonical basis of C^^^. For each 
£ = 1, . . . , A^, let us denote by Ci the law of conditionally to the event X^r = (Xjv 
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is the random variable with uniform law on {ei, . . . , e^} that was introduced after f|T^ ). 
We know that the law C of Un is given by 

1 ^ 

(30) /:(5) = -5^ai?) 

for any Borel set B C K.^ . Hence it suffices to prove that for all i, an /^^-distributed vector 
V satisfies So let us fix £ e {1, ... , N}, consider such a vector V = (V(l), . . . , V{N))^ 
and such a matrix D = diag(£i, . . . ,eiy). Note that the entries of V are independent and 
that all of them except the i-th one are invariant, in law, by multiplication by any Si. 
Hence for D' = sJ^In^ we have 

{D'D)V ^= V. 

As a consequence, we have 

{D' D)VV* {D' D)* ^=VV*. 

But we also have {D' D)VV* {D' D)* = DVV*D*, hence fl2^ is proved. It remains to prove 
ffTU]) and more precisely Formulas ffTBl) . (fTTj) and (fTSj) . So let us fix /c > 1, ni, . . . , > 1 
and find out the limit, as — )■ oo, of 

iV'=E[|f/^(l)|2"i...|f/^(A;)|2"^]. 

Note that by (130|) . we have 

(31) Ar'=-("i+-+«0E[|f/^(l)|2"i . . . |f/^(A;)|2"*] = 

(32) (1 - ^) Ar'=-("i+-+"fc)(l - e-*)"i+-+"^ nJ=iE[|^iP"^] + 

(33) Ejo=i A^'=-("i+-+'^*)+"^o-i(l - e-*)"i+-+"*-"^oE[|e-5 + ^|2"^o] E[|^i|2"^]. 

First, as soon as one of the nj's is > 2, the term of f l32p vanishes as — > oo. The same 
happens for each of the terms of the sum 0331) if two of the n^-'s are > 2. Equation f llUp 
follows easily. 

Let us now consider the case where Un = Ut, for {Ut/VN)t>o a solution of (|T5|) whose 
initial law is the uniform law on the canonical basis. Let us introduce the N x N matrix 
Pt (the dependence on A^ is implicit in Pf and in Ut) 

Pt := UtU:/N. 

By a direct application of the matricial Ito calculus (see [HI Sect. 2.1]), the process {Pt)t>o 
satisfies the SDK 

(34) dPt = {dKt)Pt - PMKt) + (^^I - P?j ds. 

First, from this SDE, it follows easily that TrP^ = 1 for all t, so that ||f/<P = A^. Second, 
it is easy to see that the law of K is invariant under conjugation by any unitary matrix. 
By uniqueness, in law, of the solutions of ( 1T5|) and (IMI) . given the initial conditions, it 

follows that for any permutation matrix Q and any matrix D as in (1291) . QUt ^= Ut and 
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DUfD* ^= Pf. It remains to prove (fTOjl and Formulas (fT6|) . ( IT7|) and (fT8|) . For each 
^1, • • • , ''^fc > 0, let us define 

f ...,nk):=N{N-l)---{N-p+ 1)E |f/t(A;)P"^] iV-("^+-+"^), 

where p = |{£ ; 7^ 0}| (the dependence of Tf on is implicit). We have to prove that 

ft(ni, . . . ,nfc) — > Tt{ni,...,nk). 

Due to exchangeability and to \\Ut\\ = N, we have Tt{l) = 1 and 
(35) 

Tt{ni, . . . ,nk) =Tt{ni + l,n2, . . ■ ,nk) ^ h Tt{ni, nfc-i, + 1) + Tt{ni, . . . , n^, 1), 

and since all |f/((i)|'s are < 1, if one considers two families rii, . . . ,nk > 1 and n[, . . . ,n'i^ > 1 
such that Uj < n'^ for all j, then 



(36) 



Tt{ni,...,nk) > ft{n\,...,n[). 



It is easy to see that thanks to fl35l) and fl36l) and to the obvious formula rt(l) = 1, it 
suffices to prove that 



(37) 



Tt{2,2) — > and 



Ttin) — > e 



for all n > 2. 



For any multiset {fci, . . . , of integers in {1, ... , A^}, let us define 

(the dependence of 7t(- ■ ■ ) in is implicit). By exchangeability, E [7t(fci, • • • , fc„)] only 
depends on the level sets partition of the map j 1— )■ kj. To prove (l37|) . we have to prove 



(38) 
and 

(39) 



Ar2E[7i(l,l,2,2)] 



A^E 



-nt 



N^oo 



for all n > 2. 



7i(l_^ 

n times _ 

Recall that {ei, . . . ,6^} denotes the canonical basis of C^^^. For each k > 1, 
d-ft{k) = el{dKt)Ptek - elPtdKtek + - ^t{k)^ dt. 

As an application of the matricial Ito calculus (see [01 Sect. 2.1]), we get the quadratic 
variation dynamics 

d(7*(A:),7*(^)) = bt{kht{i) - lk=at{k)]dt. 
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By Ito's formula for products of semi martingales, it follows that K['~ft{ki, . . . , kn)] is a 
smooth function of t which satisfies the differential system 



(40) dtE[jt{ki,...,kn)] 



-n 1 



n — 1 
N 



E[jt{ki,...,kn)] 



+ iV 5^*^^ l{* ; ^ < J> = kj}\ + 1)E -ft{ki, ...,kj,...,k„ 



where we used the convention 'jt{ki, . . . , kn) = 1 if n = 0. Let us now prove (139|) . For each 
n > 1, we define 



qt{n) = A^e"*E 



7t(l,---,l) 



(the dependence of qt{n) on is implicit). Note first that since Hf/tp = A^, for all n > 1, 

E[if/t(i)r"] iv-"+i<E[if/t(i)r] = 1, 

so that qt{n) < e"*. Moreover, by the differential system of fHOj) . we have, for all n > 2, 

+ ^E(2j-3)gt(n-l). 



dtqt{n) 



N 



-qt[n) 



i=l 



It follows that dtqt{n) converges to zero uniformly on every compact subset of M+ as 
— > oo. Since for t = 0, qt{n) = 1 for each n, we get that for each n > 2, 



1t[,n) — > i. 



SO that is proved. Let us at last prove (155]) . Thanks to the differential system of (HDjl 
and to the fact that E [7i(l, 1, 2)] = E [74(1, 2, 2)], we have 

dt¥. [7i(l, 1, 2, 2)] = -4 f 1 + E [7i(l, 1, 2, 2)] + |-E [7^(1, 1, 2)] 



A^ 



Due to exchangeability and to \\Ut\\ = N, both E [7^(1, 1,2)] and E [7^(1, 1,2)] are not 
greater than ]y(]^3T)- Therefore, we get: 



|9,(ArV*E[7t(l,l,2,2)])|<^ 



Since 7o(l, 1, 2, 2) = 0, this proves ([38 



□ 



10. Proof of Proposition 13.41 

Let TT G Fa,Tt{k) and F := F^; we have to prove that fvti'^) = e'^^'^^^. By the very 
definition of k, it is obvious that ^(Tr) = K(thin(7r)) and that k, is additive on the connected 
components of vr. Therefore, due to Lemma [6. 3 [ it is enough to prove that frti"^) = e'^^^^^ 
for TT thin and connected. Note that in this case, k,{it) is equal to k. 
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We start from Formula (122|) for /rt(7r) and are going to prove that the only non- vanishing 
term in this formula corresponds to the trivial partition r with only one block. This will 
imply: 

JSvr Je-K 

and this will prove the proposition. 

For = 1, it is trivial. Let us suppose k > 1. Since vr is thin and connected, it has a 
crossing (and this implies in fact k > 4). Let us consider a non trivial r (i.e. with at least 
two blocks) such that H := H{7t, t) has no cycle. Let c be a coloring of the edges of G/^ 
with kernel r. 

Since H is connected, there exists two blocks E and E' of H with a non-empty inter- 
section. Let Jq & n such that { Jq} = E H E' (since H has no cycle, this intersection is a 
singleton). Since vr is thin, by Proposition 15.11 E is not a singleton. Let Jq e \ {Jo}- 
Since the graph G/t^ is a circuit, there exist at least one path from Jq to Jq and one path 
from Jq to Jq- But if they were the only ones, there would be no crossing between Jq 
and the others blocks of E, and tt would not be connected (c/. for instance the (counter- 
)example page fT5l) . Therefore there exist at least two paths from Jq to Jq and two paths 
from Jq to Jq. Hence we deduce that n+(Jo,i) > 2 for a certain colour i (see page [15] for 
the definition of ?t,+ (Jo,z)). With the same argument, we get n^{Jo,j) > 2 for a certain 
j ^ i. This implies that 

r(n.(Jo)) = r«(Jo,£),£>i) = o. 

Hence, HjeTr r('^T( J)) = 0, and this implies that the only r with a non-vanishing contri- 
bution to the computation of /r^ (vr) is the trivial one with only one block. □ 

11. Proof of Proposition 13.51 

Lemma 11.1. For any Hermitian matrices A,B with B positive, any column vector V, 
any constant c G M.' 

(41) E [{A + iB- cXtVV*)-^] ={A + iB + it\c\VV*)-^ 

with Xt a random variable with law Ct- 

Proof. On can suppose V to have unit norm. First, for / a rational function bounded 
with non positive degree whose poles are all in C"*" := {2; G C; '^{z) > 0}, Ct{f) = f{—ti) 
(it can be proved via the residue formula if f{z) = and can then be generalized by 
density). Second, for Q the orthogonal projector on V-^, 

det {A + iB- cxVV*) = ((V, {A + iB)V) - ca;)det|Q(Q(A + iB)Q) 

with det|Q the determinant on the space . If we suppose c > 0, this implies that the 
entries of (A + ii? — cxVV*^^^ are rational functions of x with poles in C^. The equality 
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Proof of Proposition ESI Let X^, . . . , X^'^, Uj^, Ujj,... be as in Theorem El with 
X* ~ . If we apply repeatedly the preceding equality, we obtain that for any Hermitian 

n 

matrix A, z G C+, c G M: 

E [{zl^ -A- {XlUM* + ■■■ + X!^^U^-V^n/Nr'\Uh, • • • , U^""] 

riN 

jjl ryl*! \_TTNnTTNn* 

When n — ^ oo, Jv ^ tends a.s. to E [f/ivf/]^] = In by the law of large 

numbers, and {X^^UjfUj; + ■ ■ ■ + X^''U^'^U^''*)/N tends in law to Pj^^^ following Theorem 
12.11 Therefore, it is now easy to deduce that: 

E [{zIn -a- Mt)-^] = {zIn + ttiN - A)-^ = Pt{fz){A) 

with fz{x) = {z — x)^^. This establishes the proposition for f = fz- Using routine density 
and linearity arguments, this is enough to prove the proposition. □ 



12. Appendix 

12.1. Matrix approximations in the sense of the rank. The Cauchy transform of a 
finite measure /x on M is G^{z) := / ^^17, for z G C^. 

Lemma 12.1. Let fij^j be a sequence of random probability measures on the real line such 
that for any e > 0, there is another sequence fi"}^ converging weakly to a deterministic 
probability measure fi"^ and such that almost surely, for N large enough, for any z G C^, 

(42) \G,A^)-G,.^iz)\<^ 

Then yU^v converges weakly to a deterministic probability measure fi = Mm^^Qfi'^. 

Proof. Almost surely, by f l42|) . for any z G C+, the sequence Gfj_^{z) is a Cauchy sequence, 
hence converges to a deterministic limit G{z) such that for any e, 

(43) \G,{z)-G,4z)\<^. 

isz 

So by [2, Th. 2.4.4 b)], fi^ converges vaguely to a measure /i with total mass < 1 such that 
G^ = G and n = lime_^o f^^- To see that /i is a probability measure, it suffices to notice 
that for any measure u, iy'^{G^{iy)) increases to z/(M) as y ^ 00, and then to use □ 
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For M a random N x N Hermitian matrix with eigenvalues Ai, . . . , Aat, recall that the 
empirical spectral law of M is the random probability measure /zm defined by 

1 ^ 

(44) /^m:=^5^5a, 

and that the mean spectral law of M is the deterministic probability measure /I^ defined, 
for any bounded Borel function /, by 

(45) 7I^,(/):=E(/iM(/)], 
with fiM as in 

Lemma 12.2. For each N, let M^- be an N x N random Hermitian matrix. Suppose that 
for any e > 0, there is a sequence of random Hermitian matrices whose empirical (resp. 
mean) spectral law converges almost surely (resp. converges) to a deterministic probability 
measure /i*^ and such that almost surely, for N large enough, rank(Mjv — Mj^) < Ne (resp. 
such that for N large enough, E [rank ( Mat — M^)] < Ne). Then the empirical (resp. mean) 
spectral law of converges almost surely (resp. converges) to a deterministic probability 
measure fi = lim^-^o /i^ 



Proof. Let us first prove the almost sure version of the lemma. For any Hermitian 
matrix H and any z G C^, we have \\{z — H)^^\\op < C^z)^^. So by the formula 

{z - B)-' -{z- A)-^ = {z- B)-\B - A){z - Ay\ 

we have 

(46) \Ti{z - Mn)-'^ - Ti{z - MIj)-^\ < 2{'^z)-'^ x rank(M^ - M^) 

and for any e, with probability one, for large enough, for all z G 



lTr(z-M^)-i-iTr(^-M^)-i 



2e 2e 



It follows that the sequences fiN '■= /^a/at f^N •= A'-m^ satisfy the hypotheses of the 
previous lemma, which allows to conclude. 



Let us now prove the expectations version of the lemma. Taking the expectation of fH6|) . 
we get 

|G,,,^(-)-G^M^ (-)!<§• 

It follows that the (non random) sequences /iat := Jimm and fi% := Jim^^ satisfy the hy- 
potheses of the previous lemma, which allows to conclude. □ 



12.2. Infinitely divisible distributions and cumulants. 
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12.2.1. The classical case. Let us recall the definition of infinitely divisible laws with re- 
spect to the classical convolution * (see [27]). 

Proposition-Definition 12.3. Let fi be a probability measure on M. The we have equiv- 
alence between : 

(i) there is a sequence (kn) of positive integers tending to +oo and a sequence (z/„) of 
probability measures on the real line such that as n — > oo, 

(47) z/„. * ■ • ■ * Un — > /i, 

kn times 

(ii) there is a continuous semigroup (/i**)tG[o,oo) for the convolution * starting at Sq such 
that fi*^ = fi, 

(iii) the Fourier transform of fx has the form J^^j^ e**^d/i(t) = e*'"*-^-', with 

(48) ^,{0 ='^^ + l (^"^ - 1 - d^^'^)' 

:=-^ fort = 

7 being a real number and a being a finite positive measure on M. 

In this case, fi is said to be *-infinitely divisible, the pair (7,0"), called the Levy pair of 
/i, is unique, and will be denoted by z/J''^. 



Part (i) of the above definition characterizes such laws as the limit laws of sums of 
i.i.d. random variables. Part (ii) expresses them as the distributions of one-dimensional 
marginals of Levy processes, and Part (iii) is known as the Levy-Kinchine formula. The 
pair (7, a) can be interpreted as follows: 7 is a drift, cr({0}) represents the brownian 
component of the Levy process associated to yU, and the measure \x^o^-^da{x), when it 
is finite, represents its Poisson compound part (when this measure is not finite, the Levy 
process can be understood as a limit of such decompositions). 

12.2.2. The free case. In [131 [El, Bercovici and Voiculescu have proved that Proposition- 
Definition 112.31 stays true if one replaces the classical convolution * by the free additive 
convolution ffl and Formula by the following formula for the /^-transform of /x: 

(49) R,{z) = 7 + / f^da(t) 

(recall that the R-transform of /i is defined by the formula Rfi{z) = Gj^^i^z) + ^, see [2^ l2]). 
In this case, fi is said to be ^-infinitely divisible and is denoted by v'^" ■ 

The map A : v2''^ G {*-infinitely divisible laws} — )■ v'^" G {ffl-infinitely divisible laws} is 
called the Bercovici- Pata bijection. In (see also [IZ]), Bercovici and Pata proved that 
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this bijection preserves hmit theorems, i.e. that for any sequence (A;„) of positive integers 
tending to infinity, for any sequence (z/„) of laws on M, for any *-infinitely divisible law /i, 
we have 

Un* ■ - * l^n ^ ffl ■ - ffl Un ^A(/i). 

kn times fcn times 

12.2.3. The cumulants point of view. Let /i be a probability measure on M whose Laplace 
transform is defined in a neighborhood of zero. Its classical cumulants are the numbers 
(c„(/i))„>i defined by the formula 



log / e«*d/i(t) = V '^^^^ G C small enough). 



In the same way, for /i a compactly supported probability measure on M, the free cumu- 
lants of /i are the numbers (/i;„(/i))n>i defined by the formula 



R^i{z) = ''^kn{iJ,)z"' (z e C small enough). 



n>l 



It can easily be seen (see e.g. [HI Eq. (2.1)]) that for any Levy pair (7,/i), the classical 
(resp. free) cumulants of i'2''^ (resp. of t^^'^) are the given by the formula 



(50) Uur) = cniiy^l 



'7 + / tda{t) if n = 1, 

/r-2(l + t2)da(t) ifn>2. 



Hence the Bercovici-Pata bijection can be seen as (the continuous extension of) the map 
which transforms classical cumulants into free ones. 



12.3. Combinatorial definitions. In this section, definitions and results can be found 
in the classical book ^15j of Berge. 

12.3.1. Graphs. A graph G = {V,E) is characterized by a set V of vertices and a family 
-E = (cj, 2 G /) of edges: 

(1) If the edges are elements of V x V, G is a directed graph; 

(2) If the edges are singletons or pairs of V, G is a non- directed graph. 

Note that multiple edges are possible with this definition, since E = (cj, i G /) is a family 
and not a set, which means that the e^'s are not pairwise distinct. 



In the following three definitions, we suppose G non-directed. 
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(1) A connected component of G is a block of the smallest partition of V which is above 
E w.r.t. the refinement order. 

(2) A path in G is a sequence (wq, ej^, vi, Cjj, • • • , ej^., u^) with k > 1, vq, . . . ,Vk E V , 
ei. = {vj-i,Vj}. If, moreover, the vq = Vk, then the path is said to be a closed. 
In this case, if Vq, . . . , v^-i are pairwise distinct and ii, . . . ,ik are pairwise distinct, 
the path is said to be a cycle. 

(3) The cyclomatic number of G is equal to |/| — \ V\ + n, with n the number of con- 
nected components of G. 



The following proposition can easily be proved by induction on the number of vertices 
of G. 

Proposition 12.4. For every non directed graph G, its cyclomatic number is non-negative. 
It vanishes if and only if G has no cycle. 

The previous definitions extend easily to the framework of a directed graph G = (V^E = 
(cj, i E I)). For example, a path in G is a sequence {vq, e,^, f i, Ci^, . . . , Cij^.Vk) with £ > 1, 
vo, . . . ,Vk € V , Ci. = {vj-i, Vj). In a directed graph, two cycles c, c' are said to be disjoint 
if they do not have any edge in common, i. e. if there is no i G / such that Cj appears both 
in c and in c' (whereas they can have some vertices in common). We say that the directed 
graph G is a circuit if a closed path of G visits each of its edges exactly once (vertices can 
be passed- by more than once). One can easily prove (by induction) that any circuit is a 
union of disjoint cycles. 



12.3.2. Hypergraphs. A hypergraphis a pair H = (V, E) , where is a set (the vertices) and 
E = {Ei,i G /) is a family (the edges), of non-empty subsets of V such that Ujg/i?i = V. 
An example is given at Figure 10(a) If the edges have only one or two elements, then H 
reduces to a non directed graph. 



(a) A hypergraph (b) A hypergraph with no cycle 

Figure 10. Two examples of hypergraphs 
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The refinement order for liypergrapfis defined on a same set of vertices is defined as tlie 
refinement order on tlieir sets of edges. 

A connected component of if is a block of tlie smallest partition of V which is above E 
w.r.t. the refinement order. For instance, there is only one connected component in the 



hypergraph of Figure 10(a), hence it is said to be connected. 



Cycles are defined in hypergraphs as in non-directed graphs, except that cycles with 
length one are not accepted (in fact, they do not make sense) : a cycle of if is a sequence 
{vo,Ei^,Vi,Ei^, . . . ,Vi_i,Ei^,ve) with i > 2, vo,Vi,--- pairwise distincts, Vq = vi, 

ii, . . . ,ii pairwise distinct and f j-i, Vj G Ei_^ for all j. 



An example of hypergraph with no cycle is given in Figure 10(b) Notice that a hy- 
pergraph with no cycle is linear: two different edges have at most one vertex in common. 
Notice also that a hypergraph with only one vertex or only one edge has no cycle. 

Proposition 12.5. Let n he the number of connected components of H. Then: 

• Eiei\Ei\ - |/| -\V\+n> 0; 

• Xlie/ l-^il ^ l-^l ~ 1^1 + ^ = ^/ '^'^^y ^/ H has no cycle. 

Proof. This is a standard result (see for instance [T5| Ch. 7, Prop. 4]), which is a conse- 
quence of Proposition 112.41 For the easyness of the reader, let us recall the short proof. We 
associate to H a non directed graph G{H) with vertices V U I, and with an edge between 
X & V and z G / if and only if x G f^j. The graph G{H) is simple, i.e. there cannot be 
more than one edge between two of its vertices. Also, it has no loop, i.e. no edge from one 
vertex to itself. The number of edges is equal to Yliei l-^il' number of connected 

components is n, so the cyclomatic number of G{H) is equal to ^^gj \Ei\ — {\V\ + \I\) +n. 
It is non-negative, and vanishes if and only G{H) has no cycle, which means exactly that 
H has no cycle. □ 
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